#proof generation23/06/2025
VERINA Benchmark: Pushing the Limits of Verifiable Code Generation with LLMs
VERINA introduces a holistic benchmark for evaluating LLMs on verifiable code generation, combining code, formal specifications, and proofs across diverse difficulty levels.